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(ii) TITLE OF INVENTION: \ PURIFIED EXPANSIN PROTEINS 
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\ 

\ 

(A) INTELLECTUAL PROPERTY OFFICE, THE PENNSYLVANIA 
STATE UNIVERSITY 
STREET: 113 . TECHNOLOGY CENTER 
CITY: UNIVERSITY PARK 
STATE: PENNSYLVANIA 

COUNTRY: UNITED STATES OF AMERICA 
ZIP: 16802-7000 
COMPUTER READABLE FORM: 
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STRANDEDNESS: SINGLE 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GAC TAC GGT GGC TGG CAG AGC GGC CAC GCC ACC TTT TAT GGT 42 
Asp Tyr Gly aiy Trp Gin Ser Gly His Ala ThrlL Tyr^ly 



GGT GGT GAC GCA TCT GGC ACC ATG GGT GGA GCT TGT GGG TAT 
Gly Gly Asp Ala Ser Gly Thr Met Gly Gly Ala Cyslxy S£ r 



10 

~~ 84 



GGG AAT TTA TAC AGC CAA GGG TAT GGC ACG AAC ACG GTG GCG 15 « 
Gly Asn Leu Tyr Ser Gin Gly Tyr Gly Thr Asn Sr Val Ala 



35 40 



S» f° GCG CTA ^ AAC AAT GGA TTA AGT TGT GGT GCT 168 

Leu Ser Thr Ala Leu Phe Asn Asn Gly Le U Ser Cys Gly ^la 
45 50 



55 



TGC TTC GAA ATG ACT TGT ACA AAC GAC CCT AAA TGG TGC CTT 210 
Cys Phe Glu Met Thr Cys Thr Asn Asp Pro Lys Trp cjs^u 
60 65 70 

CCG GGA ACT ATT AGG GTC ACT GCC ACC AAC TTT TGC CCT CCT 25? 
Pro Gly Thr lie Arg Val Thr Ala Thr Asn Phe c^s So Zo 

75 80 

AAC TTT GCT CTC CCT AAC AAC AAT GGT GGA TGG TGC AAC CCT 294 
Asn Phe Ala Leu Pro Asn Asp Asp Gly Gly Trp Cys A SnTrc 



90 95 



CCT CTC CAA CAC TTC GAC ATG GCT GAG CCT GCC TTC CTT CAA 33* 
Pro Leu Gin His Phe Asp Met Ala Glu Pro Ala^he^u^n 
100 105 11Q 

ATC GCT CAA TAC CGA GCT GGT ATC GTC CCC GTC TCC TTT CGT , 7n 
He Ala Gin Tyr Arg Ala Gly He Val Pro Val Ser^e Arc 

H5 ion ^ 

120 125 



AGG GTA CCA TGT ATG AAG AAA GGT GGA GTG AGG TTT ACA ATC 420 
Arg Val Pro Cys Met Lys Lys Gly Gly Val ArgThe ^hf lie 

130 loc; 

135 140 



AAT GGC CAC TCA TAC TTC AAC CTC GTT TTG ATC ACA AAC GTC 462 
Asn Gly His Ser Tyr Phe Asn Leu Val Leu lie Thr Asn Val 

145 150 

GGT GGC GCA GGC GAC GTC CAC TCT GTG TCG ATA AAG GGG TCT 504 
Gly Gly Ala Gly Asp Val His Ser Val Ser lie Lys Gly Ser 
155 160 165 

CGA ACT GGA TGG CAA TCC ATG TCT AGA AAT TGG GGC CAA AAC 546 
Arg Thr Gly Trp Gin Ser Met Ser Arg Asn Trp Gly Gin Asn 
170 175 180 

TGG CAA AGC AAC AAC TAT CTC AAT GGC CAA GGC CTT TCC TTT 588 
Trp Gin Ser Asn Asn Tyr Leu Asn Gly Gin Gly Leu Ser Phe 
185 190 195 

CAA GTC ACT CTT AGT GAT GGT CGC ACT CTC ACT GCC TAT AAT 63 0 
Gin Val Thr Leu Ser Asp Gly Arg Thr Leu Thr Ala Tyr Asn 
200 205 210 

CTC GTT CCT TCC AAT TGG CAA TTT GGC CAA ACC TAT GAA GGC 672 
Leu Val Pro Ser Asn Trp Gin Phe Gly Gin Thr Tyr Glu Gly 

215 220 

CCT CAA TTC 681 

Pro Gin Phe 

225 

(3) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 

(B) TYPE: AMINO ACID 

( D ) TOPOLOGY : UNKNOWN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Ala Gly Gly Gly Trp 
1 5 

Gly Asp Ala Ser Gly 

20 

Leu Tyr Ser Gin Gly 

35 



Val Asn Ala His Ala 

10 

Thr Met Gly Gly Ala 

25 

Tyr Gly Thr Asn Thr 

40 



Thr Phe Tyr Gly Gly 

15 

Cys Gly Tyr Gly Asn 

30 

Ala Ala Leu Ser Thr 

45 



Ala Leu Phe Asn Asn Gly Leu Ser Cys Gly Ala Cys Phe Glu lie 

50 55 60 



# 




100 



105 
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120 



125 



Arg Cys Gin Asn Asp Gly Lys Trp Cys Leu Pro Gly Ser lie Val 



Val Thr Ala Thr Asn Phe Cys Pro Pro Asn Asn Ala Leu Pro Asn 



Asn Ala Gly Gly Trp Cys Asn Pro Pro Gin Gin His Phe Asp Leu 

95 100 105 

Ser Gin Pro Val Phe Gin Arg lie Ala Gin Tyr Arg Ala Gly lie 

110 115 120 

Val Pro Val Ala Tyr Arg Arg Val Pro Cys Val Arg Arg Gly Gly 

125 130 135 

lie Arg Phe Thr lie Asn Gly His Ser Tyr Phe Asn Leu Val Leu 

140 145 150 

lie Thr Asn Val Gly Gly Ala Gly Asp Val His Ser Ala Met Val 

155 160 165 

Lys Gly Ser Arg Thr Gly Trp Gin Ala Met Ser Arg Asn Trp Gly 

170 175 180 

Gin Asn Trp Gin Ser Asn Ser Tyr Leu Asn Gly Gin Ser Leu Ser 

185 190 195 

Phe Lys Val Thr Thr Ser Asp Gly Gin Thr lie Val Ser Asn Asn 

200 205 210 

Xaa Ala Asn Ala Gly Trp Ser Phe Gly Gin Thr Phe Thr Gly Ala 

215 220 225 

His Val Arg 

(4) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 

(B) TYPE: AMINO ACID 

( D ) TOPOLOGY : UNKNOWN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

His Met Gly Pro Trp lie Asn Ala His Ala Thr Phe Tyr Xaa Xaa 
15 10 15 



65 



70 



75 
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Gly Asp Ala Xaa Xaa Thr Met Gly Gly Ala Cys Gly Tyr Gly Asn 
130 20 25 30 

Leu Tyr Ser Gin Gly Tyr Gly Leu Glu Thr Ala Ala Leu Ser Thr 

35 40 45 

Ala Leu Phe Asp Gin Gly Leu Ser Cys Gly Ala Cys Xaa Glu Leu 

50 55 60 

135 Met Cys Val Asn Asp Pro Gin Trp Cys lie Lys Gly Arg Ser lie 

65 70 75 

Val Val Thr Ala Thr Asn Phe Cys Pro Pro Gly Gly Ala Cys Asp 

80 85 90 

Pro Pro Asn His His Phe Asp Leu Ser Gin Pro lie Tyr Glu Lys 
140 95 100 105 

lie Ala Leu Tyr Lys Ser Gly lie lie Pro Val Met Tyr Arg Arg 

110 115 120 

Val Arg Cys Lys Arg Ser Gly Gly lie Arg Phe Thr lie Asn Gly 

125 130 135 

145 His Ser Tyr Phe Asn Leu Val Leu Val Thr Asn Val Gly Gly Ala 

140 145 150 



Gly Asp Val His Ser Val Ser Met Lys Gly Ser Arg Thr Lys Trp 
% 155 160 165 

Gin Leu Met Ser Arg Asn Trp Gly Gin Asn Trp Gin Ser Asn Ser 
150 170 175 180 

Tyr Leu Asn Gly Gin Ser Leu Ser Phe Val Val Thr Thr Ser Asp 

185 190 195 

Arg Arg Ser Val Val Ser Phe Asn Val Ala Pro Pro Thr Trp Ser 

200 205 210 



155 Phe Gly Gin Thr Tyr Thr Gly Gly Gin Phe Arg Tyr 

215 220 



(5) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 
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160 (A) LENGTH: 227 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: UNKNOWN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

165 Lys XAA Ser Val Ala Gin Ser Ala Phe Ala Thr Phe Tyr Gly Gly 
15 10 15 

Lys Asp Gly Ser Cys Thr Met Gly Gly Ala Cys Gly Tyr Gly Asn 

20 25 30 



Leu Tyr Asn Ala Gly Tyr Gly Leu Tyr Asn Ala Ala Leu Ser Ser 

170 35 40 45 

O Ala Leu Phe Asn Asp Gly Ala Met Cys Gly Ala Cys Tyr Thr lie 

•J3 50 55 60 

m 

tt Thr Cys Asp Thr Ser Gin Thr Lys Trp Cys Lys Pro Gly Gly Asn 

|j 65 70 75 

'■•Us* 

H 175 Ser lie Thr lie Thr Ala Thr Asn Leu Cys Xaa Pro Asn Trp Ala 

3 80 85 90 



Leu Pro Ser Asn Ser Gly Gly Trp Cys Asn Pro Pro Leu Xaa His 

95 100 105 

Phe Asp Met Ser Gin Pro Ala Trp Glu Asn lie Ala Val Tyr Gin 
180 110 115 120 

Ala Gly lie Val Pro Val Asn Tyr Lys Arg Val Pro Xaa Gin Arg 

125 130 135 

Ser Gly Gly lie Arg Phe Ala lie Ser Gly His Asp Tyr Phe Glu 

140 145 150 

185 Leu Val Thr Val Thr Asn Val Gly Gly Ser Gly Val Val Ala Gin 

155 160 165 

Met Ser lie Lys Gly Ser Asn Thr Gly Trp Met Ala Met Ser Arg 

170 175 180 

Asn Trp Gly Ala Asn Trp Gin Ser Asn Ala Tyr Leu Ala Gly Gin 
190 185 190 195 
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Ser Leu Ser Phe lie Val Gin Leu Asp Asp Gly Arg Lys Val Thr 

200 205 210 

Ala Trp Asn Xaa Ala Pro Xaa Asn Trp Leu Xaa Xaa Xaa Xaa Xaa 

215 220 225 

195 Xaa Xaa 

(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 

(B) TYPE: AMINO ACID 
200 (D) TOPOLOGY: UNKNOWN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Asn Gly Gly Trp Glu Arg Gly His Ala Thr Phe Tyr Gly Gly 
15 10 15 



Ala Asp Ala Ser Gly Thr Met Gly Gly Ala Cys Gly Tyr Gly Asn 

fc J3 205 20 25 30 

m 



Leu His Ser Gin Gly Tyr Gly Leu Gin Thr Ala Ala Leu Ser Thr 

35 40 45 



jj,* Ala Leu Phe Asn Ser Gly Gin Lys Cys Gly Ala Cys Phe Glu Leu 

a 50 55 60 

210 Thr Cys Glu Asp Asp Pro Glu Trp Cys lie Pro Gly Ser lie lie 

65 70 75 



Val Arg Tyr Asn Leu Ala Asn Phe Ala Leu Ala Asn Asp Asn Gly 

80 85 90 

Gly Trp Cys Asn Pro Pro Leu Lys His Phe Asp Leu Ala Glu Pro 
215 95 100 105 

Ala Phe Leu Gin lie Ala Gin Tyr Arg Ala Gly lie Val Pro Val 

110 115 120 

Ala Phe Arg Arg Val Pro Cys Glu Lys Gly Gly Gly lie Arg Phe 

125 130 135 

220 Thr lie Asn Gly Asn Pro Tyr Phe Asp Leu Val Leu lie Thr Asn 

140 145 150 

Val Gly Gly Ala Gly Asp lie Arg Ala Val Ser Leu Lys Gly Ser 

155 160 165 
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Lys Thr Asp Gin Trp Gin Ser Met Ser Arg Asn Trp Gly Gin Asn 
225 170 175 180 

Trp Gin Ser Asn Thr Tyr Leu Arg Gly Gin Ser Leu Ser Phe Gin 

185 190 195 

Val Thr Asp Ser Asp Gly Arg Thr Val Val Ser Tyr Asp Val Val 

200 205 210 

230 Pro His Asp Trp Gin Phe Gly Gin Thr Phe Glu Gly Gly Gin Phe 

215 220 225 



(7) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 
235 (A) LENGTH: 226 

(B) TYPE: AMINO ACID 
O (D) TOPOLOGY: UNKNOWN 

jjjj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

^ Asp Tyr Ser Ser Trp Gin Ser Ala His Ala Thr Phe Tyr Gly Gly 

240 1 5 10 15 

W 

O Gly Asp Ala Ser Gly Thr Met Gly Gly Thr Cys Gly Tyr Gly Asn 

U 20 25 30 

Leu Tyr Ser Thr Gly Tyr Thr Asn Thr Ala Ala Leu Ser Thr Val 

35 40 45 



m 
m 

in 245 Leu Phe Asn Asp Gly Ala Ala Cys Arg Ser Cys Tyr Glu Leu Arg 

p; 50 55 60 

Cys Asp Asn Asp Gly Gin Trp Cys Leu Pro Gly Ser Val Thr Val 

65 70 75 

Thr Ala Thr Asn Leu Cys Pro Pro Asn Tyr Ala Leu Pro Asn Asp 
250 80 85 90 

Asp Gly Gly Trp Cys Asn Pro Pro Arg Pro His Phe Asp Met Ala 

95 100 105 

Glu Pro Ala Phe Leu Gin lie Gly Val Tyr Arg Ala Gly lie Val 

110 115 120 

255 Pro Val Ser Tyr Arg Arg Val Pro Cys Val Lys Lys Gly Gly lie 

125 130 135 
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Arg Phe Thr lie Asn Gly His Ser Tyr Phe Asn Leu Val Leu Val 

140 145 150 

Thr Asn Val Ala Gly Pro Gly Asp Val Gin Ser Val Ser lie Lys 
260 155 160 165 

Gly Ser Ser Thr Gly Trp Gin Pro Met Ser Arg Asn Trp Gly Gin 

170 175 180 

Asn Trp Gin Ser Asn Ser Tyr Leu Asp Gly Gin Ser Leu Ser Phe 

185 190 195 

265 Gin Val Ala Val Ser Asp Gly Arg Thr Val Thr Ser Asn Asn Val 

200 205 210 



m 
w 
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m 
m 



Val Pro Ala Gly Trp Gin Phe Gly Gin Thr Phe Glu Gly Gly Gin 

215 220 225 



Phe 
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